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1.  Introduction. 


Consider  a  two-person  zero-sum  game  with  matrix  [a^  ]  which  is 
being  played  for  one  unit  of  time.  If  in  a  time  interval  At  player  1 
plays  row  i  and  player  2  column  j  the  payoff  to  player  1  is  a^At. 
Payoffs  are  accumulated  with  time  so  that  the  total  payoff  is 

1 

\  *« dt- 

0 


Of  course,  if  instantaneous  changes  of  rows  or  columns  were  allowed  the 
Integral  would  not  be  defined.  John  Danskin  [1]  has  proposed  to  resolve 
this  dilemma  by  approximating  the  game  by  n-stage  games  with  perfect 
Information.  In  the  n-stage  approximation,  player  2  is  allowed  to  change 

k 

the  column  index  only  at  times  t  ■  — ;  k  ■  0,1,..., n-1,  wMle  player  1 

is  allowed  to  change  the  row  index  only  when  t  =  -  +  — ,  where  0  £  a  ac  1 

n  n 

is  a  fixed  parameter.  The  payoff  is  then 


dt 


i  n 

n  “ 


o  a 


k-l 


Wi 


(l-o)a 


Vk* 


The  row  index  ig  Is  called  the  predecessor.  We  denote  this  game 
Gn^a;i0^  an<*  n-stage  fj-game  with  predecessor  ig.  Its  value 


>1  n 

Q  (o;in)  -  min  max  ...  min  max  —  £  aa,  .  +  (l-o)a.,  .  (1.1) 

n  0  ix  jn  in  n  k-i  Wk  "kJk 

John  Danskin  has  proved  (In  a  far  more  general  context)  that  the  sequence 


» Iq)  »^2 * ^q)  * • • • 


(1.2) 


converges  to  a  limit  which  does  not  depend  on  the  predecessor  ig.  This 


limit 


2 


fl(o)  -  Ilia  On(oji0) 


Is  then  called  the  fl-value  of  a  matrix-differential  game  [a^  ] . 

The  main  purpose  of  this  technical  report  is  to  establish  the 
following  formula  for  the  0-value  of  a  two-by-two  game. 


Let  A  ■ 


all  ®21 
^12  a22 


be  a  matrix,  let  I  -  {1,2}  be  the  index  set,  let 


0  £  o  £  1.  Denote  M  -  max  max  (a..},  M  ■  min  min  {a.,}, 

i€I  j€I  J  i6I  j€I  J 

V  ■  min  max  {a..},  V  ■  max  min  {a.  },  and 
j61  i€I  J  161  J6I  J 


Then 


v(o)  -  j[o(MfV)  +  (l-o)  (MfV)  ] 
V  if  v(o)  i.  V, 


Q(o)  - 


v(o)  if  V  <  v(o)  <  V, 
'  V  if  v(o)  ^  V. 


(1.3) 


We  provide  two  independent  proofs  of  this  formula,  each  using  a 
different  approach — Shubert's  (Sec.  2)  or  Washburn's  (Sec.  3).  These  two 
sections  can  be  read  independently.  The  notation  is  common.  A  brief 
comparison  and  discussion  of  the  two  approaches  is  contained  in  Sec.  A, 
together  with  some  ideas  about  possible  generalization  to  larger  games. 
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2.  Shubert's  Approach. 

From  (1.1)  It  is  easily  seen  that  ftn(o;*)  satisfy  the  recurrence 
relation 

‘W°ik) '  +  *  ^+r  Vo;l),>  k  6  <2>1> 

with  initial  condition  11^  :  0.  This  relation  could  be  used  to  evaluate 
fl(o),  which  can  be  done  on  a  computer  provided  numerical  values  for  o 
and  the  entries  a^  are  given. 

However,  if  we  wish  to  obtain  the  entire  function  $!(•)  and  establish 
some  of  its  properties,  the  recurrence  relation  (2.1)  may  be  of  little  help. 
An  analogy  with  similar  problems  suggests  that  perhaps  8(*)  can  also  be 
found  as  a  solution  of  a  functional  equation  obtained  from  (2.2)  by  letting 
n  ■*>  ».  Unfortunately,  such  a  process  would  yield  the  equation 

fl(*;k)  -  min  max  fi(*;i), 
j  i 

which  is  satisfied  by  any  function  such  that  n(o;k)  -  fl(o;i)  for  k  +  t. 

In  what  follows  we  are  going  to  show  that  the  difficulty  can  never¬ 
theless  be  circumvented  by  solving  the  equation 

ft(o;k)  -  min  max  {A[oa,,+(l-o)a. . ]  +  (1-A)0(a;i) } ,  k  6  I,  (2.2) 
j€I  i€I  J  2 

where  0  <  A  a£  1  is  a  parameter. 

We  first  prove  that  this  equation  has  a  unique  solution,  depending, 
of  course,  on  the  parameter  A-  Next  we  show  that  as  A  -*•  0  this  solution 
converges  to  the  ft-value.  As  a  byproduct  we  obtain  a  new  proof  for  the 
convergence  of  the  sequence  (1.2).  Finally,  by  actually  finding  the  solu¬ 
tion  of  (2.2)  for  small  A  >  0  and  letting  A  -*  0  we  establish  the  formula 
(1.3)  for  the  fl-value. 
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We  begin  with  some  definitions.  To  simplify  notation  we  temporarily 
suppress  a  and  abbreviate 

hij  ■  «\j  +  (1-°)*1r 
where  1  €  I,  j  €  I,  k  6  I. 

Let  x  ■  (x^,x2)  G  Re2  be  a  two  component  vector,  let  0  £  X  £  1. 
Denoting 

hJjU.x)  -  XhJj  +  (1-X)x1 
i  €  I,  k  €  I,  j  €  I, 
we  define  a  parametric  family  of  mappings 

T(X,x)  -  (T1(X;x),T2(X,x)) 

by  (2.3) 

T.  (X ,  x)  ■  min  max  (X  ,x) ;  k  €  I . 
j€I  i€I  J 

Let  21^  *— 2  *  *  *  *  be  a  8etluence  °f  vectors  from  Re2  defined  by 

x^  €  Re2,  ^ ^  f (~,x^) ,  n  *  1,2, .. .  (2.4) 

Comparing  with  (2.1)  we  see  that  with  x^  -  £ 

xa|1  -  (Qn(o;l),fln(o;2)), 

so  that  x^,^****  is  indeed  the  sequence  defining  the  Q-value  of  the 
fl-game  with  matrix  A. 

Next,  let  ||*|l  be  the  maximum  norm  on  Re2 


Mill  "  tnax{jx1|,|x2|}, 
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let  a  be  a  constant  such  that 


Let 


max  max  |  a .  | 
i€I  j€I  J 


£  a. 


X  «  {x  €  Re2:  | |x| |  £  a} 
We  are  now  ready  to  state  the  following  theorem 


Theorem  1:  Let  0  £  X  £  1,  let  T  be  the  mapping  defined  by  (2.3)  and 
restricted  to  the  domain  X,  let  x^t^****  be  the  sequence  (2.4).  Then 
the  following  is  true. 

(1)  For  every  0  <  X  £  1.  there  is  a  unique  fixed  point 

x(X)  -  T(X,x(X)). 


(2)  The  limit 

lim  x(X)  -  £  ’ 

X-K) 

exists  and 

(3)  The  sequence  2Li»2L2’*'*  converges  and 

lim  x  ■  lim  £(X) . 
rr"»  ~n  x+0 
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Proof:  The  proof  Is  divided  into  a  sequence  of  eight  leases.  Lemma  3 
proves  statement  (1) ,  Lemma  7  statement  (2)  and  Lemma  8  statement  (3) . 
We  would  like  to  point  out  that  the  proof  may  seem  rather  uneconomical 
for  the  two-by-two  case  (for  instance,  Lemma  3  can  be  proved  without 
using  Banach’s  theorem). 

We,  nevertheless,  leave  it  in  its  present  form  with  an  eye  on 
possible  generalization. 

Lemma  1:  For  any  0  £  X  £  1 

x  €  X  »  T(X  ,x)  €  X 


Proof : 


I  I  T(A  ,x)  |  |  -  maxi  1^(1  ,x)  |  ,  |T2(X  ,x)  |  ) 

s£  max  max  max  |Xh*  +  (l-X)x  | 
k€I  i€T  j€I  J 

£  Aa  +  (1— X) | |x|  |  £  a  if  x  €  X. 
Lemma  2:  For  any  0  £  X  £  1 


| |T(A,x)  -  T(X,x')||  £  (1-A) | |x-x’ | | 

Proof:  If  (f ^ , f 2>  and  (gj^g^  are  any  two  real  vectors  then 

|maxif1,f2}  -  maxigltg2}  ^  max{ | f 1“«1 I > I f 2"g2 I > , 
and 

|min{f^,f2}  -  min{g1,g2}|  £  maxi  | f^-gj  ,  I f2"g2 1  > • 


Using  these  two  inequalities  in  the  reversed  order  we  obtain 
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|  |T(A ,x)  -  T(X,x')||  ■  max | min  max  (X,x)  -  min  max  H^.  (X,x')| 

k€I  j€I  i€I  J  j€I  i€I  2 


<  max  max | max  H^.  (X,x)  -  max  (X,x')| 
k€I  j€I  i€I  2  161  J 

ac  max  max  max|H^  (X ,x)  -  (X,x')| 

k£I  J6I  161  2 

-max  | (l-X)<x.-x!)|  -  (1-X) | |x  -  x' | | . 

161 


Lemma  3:  For  every  0  <  X  <  1  there  Is  a  unique  x  6  X  such  that 
T (X  ,x)  -  x. 

Proof :  By  Lemma  2  the  map  T  is  for  any  fixed  0  <  X  £  1  a  contraction 
and  by  Lemma  1  it  maps  a  closed  bounded  set  X  into  itself.  Hence  by 
Banach's  fixed  point  theorem  it  must  have  a  unique  fixed  point. 

such  that 


(2.5) 


(2.6) 


(2.7) 


(2.8) 


Lemma  4;  For  every  0  <  X  i  1  there  is  p  6  I  and  q  6  I 


either 


or 


or 


or 


xx(X)  -  h*p  ,  x2(X)  -  hjp  +  X(h2q-hlp), 

VX)  “  h2q  *  *2(X>  "  h2q  +  X(h2p_h2q)* 
hi  +  h?  -  Xhf 

?!<*>  -  -2*-  2  - ia  , 

hi  +  h?  -  Xhi 

72W  •  2p  2i\ - 2a  . 

*1(X>  “  h}p  >  *2<X)  "  h2q* 
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Proof:  Since  x  ■  (x^,x2>  ia  a  fixed  point  of  the  map  T(X,»)  we  muat 
have 

x^  -  h[p  +  (l-XJx^ , 

*2  "  hjq  +  (l“A)Xj , 

where  i  €  I,  J  €  I.  p  6  I,  q  €  I.  Choosing  values  1  or  2  for  the 
index  1  and  the  index  j  we  obtain  four  sets  of  pairs  of  linear  equations: 

*1  “  Xhlp  + 

x2  -  Xh*q  +  (l-X)x^ 

*1  "  Xh2p  + 

X2  "  *h2q  + 

X1  "  Xh2p  + 

*2  "  Ahiq  + 

xi  *  Xh!P  + 

x2  ■  Xh^  +  (1-X)x2 

The  solution  of  each  set  gives  the  corresponding  expression  (2.5)— (2.8) . 

Lemma  5:  lim  Ixl  (X)  -  1T_ (X)  |  «*  0. 

X-K)  1  1 

Proof :  Assume  the  statement  is  false,  i.e.  there  is  Cq  >  0  such  that 
for  every  X^  >  0  there  is  0  <  X  <  X^  for  which 

|xjf  (X)  -  x2(X)  |  2  Eq • 
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By  Lemma  4  thia  could  only  ba  If  £(X)  ii  given  by  (2.8)  i.e. 

x2(X)  -  hjp  ,  x2(X)  -  h^  (2.9) 

and 

|hlp  '  h2,l  1  e0*  (2.10) 

Next  since  5T(\)  is  a  fixed  point 

x!  (X)  -  min  max  (X,x),  k  6  I, 

J6I  i€I  J 

and  because  of  (2.9)  this  implies 

Hjp(A,x)  i  Hjp(X,x), 

and 

H2q(X.x)  at  H^(X,x) . 

Substituting  from  (2.9)  we  obtain 


Xhlp  +  (1-X)hlp  *  Xh2p  +  (l~l)h2q, 

and 

xh2,  +  *  >hi, +  n-w^p. 


which  is  equivalent  to 


*  X(h‘p-h|q). 


hlp  "  h2q  *  >(hlp'h2q)  ’ 


Hov^ver ,  these  two  inequalities  cannot  hold  for  arbitrarily  small  X  >  0 
unless  h*p  ■  h^,  which  contradicts  (2.10).  The  lemma  is  proved. 
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Lemma  6:  Let  -  (f^ . f^)  and  -  (g^ . g^),  j  -  1 . n 

be  given  vectors.  Let  £or  every  0  <  X  «  X^  at  least  one  of  the  n 
inequalities  Xf^  <  ^  be  satisfied. 

Then  there  is  a  Xq  6  (0,X^]  such  that  if  the  inequality 
Xf.  £  g,  is  satisfied  for  X  ■  Xn  it  is  satisfied  for  any  0  <  X  $t  X_. 

“J0  0  '  o 


Proof :  Since  all  these  inequalities  are  linear  there  is,  for  every  j, 

a  closed  Interval  Jj  c  [0,X^]  such  that  the  j-th  inequality  is  satisfied 

if  and  only  if  X  €  J,.  Although  some,  of  r  ose  intervale  may  be  vacuous 

J  A 

the  system  must  cover  (O.X.^]  since  by  assumption  at  least 

one  of  the  inequalities  is  satisfied  for  any  X  €  (0,X^].  Hence,  there 

must  be  at  least  one  interval  with  nonempty  Interior  and  left  endpoint 

zero,  say  J.  ■  [0,X_],  X.  >  0,  and  the  statement  follows. 

Jq  u  u 


Lemma  7:  There  exists  Xq  >  0,  a  vector  *  U^,^)*  an<*  a  vector 
b  -  (b^>b2^  Pjcb  that  eitber 


x(X)  -  1  +  Xb  for  all  0  <  X  <  XQ,  (2.11) 

or 

x(X)  -  for  all  0<U  XQ.  (2.12) 


In  either  case  the  vector  £  is  such  that 


h  "  V 

Proof :  By  Lemma  5  the  fixed  point  x;(X)  is  given  by  one  of  the  expressions 
(2.5)-(2.8).  Since  p  and  q  generally  depend  on  X  this  leaves  16 
possibilities. 
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Since  x(X)  is  a  fixed  point  there  is  p  6  I*  q  €  I,  r  €  I, 
e  €  I*  in  general  depending  on  X,  such  that 


and 


*1  “  Hia(X»2) 
i  pq 


min  max  Hj (X,x)  , 
J€I  161  3 


x,  ■  H*  (A,x)  -  min  max  H*  (X,x). 
L  rfl  J6I  i€I  3 

But  A  ■  min[max(A,B) ,  max(C,D)]  is  equivalent  to 

A  ^  B  and  AiC 


or 


A  ^  B  and  A  at  D. 


(2.13) 


Hence,  (2.13)  is  satisfied  (with  some  specific  p,  q,  r,  s)  if  and  only 
if  at  least  one  of  the  resulting  four  systems  of  four  inequalities  between 
pairs  of  the  H's  is  satisfied.  Now,  each  of  these  inequalities  is  of 
the  form 

Xh  +  (l-X)x  2  Xh*  +  (l-X)x',  (2.14) 


where  h  and  h'  each  stands  for  one  of  the  h^’s  and  x  and  x' 
each  for  one  of  the  two  components  of  x*. 

By  Lemma  6  there  must  be  X^  >  0  such  that  for  0  <  X  s  X^  £(X) 
cannot  be  given  by  (2.8)  unless  h^  ■  h^. 

Hence,  substituting  for  x  and  jc'  any  one  of  the  expressions 
(2.5)-(2.7)  or  the  expression  (2.8)  with  h^  -  h^,  the  inequality  (2.14) 


becomes  either 


X(h-h’)  +  (l-X)XB  *  0 


or 


(2-X)X(h-h')  +  (l-X)XB  *  0 
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where  B  is  a  constant  involving  the  h^'s.  But  X,  being  positive, 
can  be  canceled  and  the  inequalities  become  linear  in  X. 

*hus  we  have  a  finite  number  of  systems  of  four  linear  inequalities 
and  since  the  fixed  point  x  exists  by  Lemma  3  for  every  0  <  X  tf  1  at 
least  one  of  the  systems  must  be  satisfied  for  every  0  <  X  <  X^.  Hence, 
by  Lemma  6  there  is  a  Xq  >  0  such  that  if  a  particular  system  is  satis¬ 
fied  for  X  -  XQ  it  remains  satisfied  for  all  0  <  X  i  XQ.  Consequently, 
the  indexes  p,  q,  r,  s,  which  identify  the  fixed  point  for  X  ■  Xq 
will  remain  unchanged  as  X  decreases  to  zero.  Hence  £(X)  will  for 
every  0  <  X  £  X^  be  given  by  one  of  the  expressions  (2. 5) -(2. 7)  with  p 
and  q  no  longer  changing  with  X  and  thus  by  Lemma  5  must  have  one  of 
two  forms  (2.11)  or  (2.12)  with  ^ •  The  lemma  is  proved. 


Lemma  8:  Let  x^ , , • • •  be  a  sequence  satisfying  (2.4),  let 

£  *  lim  x(X).  Then  the  sequence  x,  ,x„,...  converges  and  lim  x  ■  £ . 
A -M3  n-*» 

Proof :  Let  e  >  0,  let  N  be  a  positive  integer  such  that 


(2.15) 


and 

ss  X  m '  k  for  every  n  ■ “-1 .  (2-16) 

k«0 

where  K  »  max{  |  |b|  j ,  |  |£-2b|  |  (2-Xq)-2}  and  XQ,b  are  those  of  Lemma  7. 

Such  an  N  exists  for  any  e  >  0  since  the  harmonic  series  is  Cesaro 
summable  to  zero. 

Let  n  be  a  nonnegative  integer.  By  Lemma  2 


I  1%+n  “  -^N+n)  I  I  "  I  I^n+o  »%Ki-l^  “  TWn  I 

S  N+n  1 1%+n-l  “  -(N+n*  I  I 


(2.17) 
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and  by  triangular  inequality 


I  I^N+n-1  "  -(N+n)  N  *  I  I^Hti-l  "  -^N+n-1*  1 1  +  I  l-^N+n*  "  -(N+n-lJ  I I  (2*18> 
By  Lemma  7  if  0  <  X  *;  A-j  and  0  <  X '  a  XQ 


or 


|x:(X)  -  jT(X  ')  |  |  <  |x  -  X 1 1  |  |b 


|x(X)  -  x(X')||  *  |X  -  X'|  1 


depending  on  which  of  the  two  expressions  (2.11)  and  (2.12)  applies. 
Hence 


-  £<issrr>  I 


(N+n)  (N+n-1) 


(2.19) 


Putting  (2.17),  (2.18)  and  (2.19)  together  and  calling  |  | u  1 1 


-A  we  have 
n 


*  -r  N+n-1  A  .  K 

An  N+n  An-1  +  (N+n)2 


Iterating  we  obtain  eventually 

A  <JLa  +  JL  f  I 

An  *  N+n  A0  +  N+n  J-n  k 
k"0 

for  every  n  -  0,1,...  and  hence  by  (2.15)  and  (2.16)  and  the  obvious 
fact  that  0  a  ||^nl!  *  <*»  the  radius  of  X,  we  have 


An  <  e  for  every  n  •  0,1,...  . 

Thus  lim  A  <  c  for  any  e  >  0  so  that  lim  I  lx  -  x(— )  |  |  -  0  and 

n  n-—  n 

since  by  Lemma  7  lim  £(-)  -  £  the  proof  of  Lemma  8  and  also  of  Theorem 


1  is  terminated. 
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Now  we  can  proceed  with  the  proof  of  the  formula  for  the  ft-value. 


Theorem  2:  The  formula  (1.3)  Is  true. 


Proof:  Since  always  V  s  V  we  distinguish  two  cases. 

Case  1:  V  <  V 

With  no  loss  of  generality  assume  that  the  matrix  A  is  such  that 

an  *  a22  i  max{a12,a21), 

relabelling  the  entries  if  necessary.  Then  we  have 

M  -  a^,  V  -  a22  and  V  -  max{a12,a22>. 

We  will  again  use  the  notation 

hij  “  oakj  +  and  Hij  "  ^hl[j  +  (l-Xjx^. 

Notice  that  in  this  notation 

v(c)  -  y(h22+hii* 


k 

and  that  if  k  -  i,  h^  ■  a^ . 

Case  1.1:  v(o)  2  a22 

Let 


X1  "  *h22  +  ^“x^h22 


x  *  h^ 
x2  22 


(2.20) 
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■o  Chat 

H^2<X,x)  "  «nd  H^2(X,ic)  -  x2  (2.21) 

Next 

Hj2(X,x)  -  Hj2(X,x)  -  X(hJ2-hJ2)  +  (1-X)(x2-Xl) 

-  X[hJ2  -  hj2  +  (l-XKh^-h^)], 

which  is  nonnegative  In  a  positive  neighborhood  of  zero  if  and  only  if 
h22  “  h12  31  But  h22  "  *22'  h12  "  *12  80  that  indeed 
H^2(X,x)  -  H^2(X,x)  >  0  for  small  X  >  0. 

Next 


H22<X ,x)  -  Hj^X.x)  -  XlhJ2  -  hj2  +  (1— X)  (h22“h^2)  ]  <  0 

if  and  only  If  h*2  £  h^,  which  is  true  since  h*2  -  a22  and  h^  ■  a^. 
Thus  for  small  X  >  0 


Similarly 


-  min  max  Hj  (X,x) 
j  i 


(2.22) 


H22  “  Hi2  *  0  m  2h22  “  ^h12+h22*  *  0  "  2*22  *  *22  +  *12' 

and 

H22  ’  H11  *  0  *  2h22  "  2v(0)  *  °' 
which  is  true  since  h*2  -  a22>  Hence,  for  small  X  >  0 

H*  (X,x)  -  min  max  H*  (X,x), 

“  j  i 


16 


which  together  vith  (2.21)  and  (2.22)  means  that  for  all  X  >  0,  In 
neighborhood  of  zero  (2.20)  Is  a  fixed  point  of  the  mapping  (2.3). 
quently  by  Theorem  1  the  fl-value 


aw  -  i«j  -  h|2  -  «22. 


Case  1.2: 


v(a)  *  V 


Let  p  €  I  and  q  (  I  be  such  that  V  ■  a  .  Notice  that 

-  pq 


so  that 


Let 


so  that 


'(c)  -|[hP  +  hj;  j. 

2  qq  pp 


x  ■  hP  ,  x  ■  Xh^  +  (l-X)hP 

p  pq  q  pp  pq 


x  ■  Hp  (X,x)  and  x  -  (X,x). 

p  pq  q  pp  - 


Similarly  as  in  Case  1.1. 


-  Hp  *  0 

qq 


which  is  true  since  hP  ■  a  -V.  Next 

pq  pq  - 


Hp  -  HP 

pq  pp 


X(hP  -hP  )  - 

pq  pp 


-  2v(o)  i  0, 


X  (a  -a  ) 

pq  PP 


*  0, 


so  that 


Similarly 


Hp  ■  min  max  HP 

pq  j  i  « 


some 

Conse- 

P  i  q 

(2.23) 

(2.24) 


(2.25) 
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since 

and 

since 

a  £  a  Hence 

pq  qq 

Hq  ■  min  max  Hq 
"  j  i  11 

and  (2.23)-(2.26)  imply  fl(o)  ■  h£  ■  V. 

Case  1.3.  V_  <  v(o)  <  V 

Calling  again  V  -  a^  we  now  take 

XP  •  £  w'  -  XhpP'  • 

xq  ’  2-A  <2v(o>  -  ihq,) ' 

We  have  now 


Hq  -  Hq  >0  »  hP  -  hq  *  0 

pp  qp  pq  qp 


hp  -  a  ■  V  >  M  ■  a  ■  hq  , 

pq  pq  -  -  qp  qp 


Hq  -  Hq  £  0 

pp  qq 


hp  -  hq  £  0 

pq  qq 


Next 


H 


P  _ 

°.q 


x  ■  HP  (A,x)  ,  x  ■  Hq  (A,x). 

p  qq  -  q  pp  - 


Hp  -  X  [hP  -  hP  +  (hq  -hP  )J 

pq  qq  pq  2-x  pp  qq 


so  that  for  small  X  >  0 


HP  -  HP  i  0  if  and  only  if  2(hP  -hP  )  +  hq  -  hP  2  0 

qq  pq  qq  pq  pp  qq 


2v(o)  -  2hP  2  0, 

pq 


(2.-26) 


(2.27) 


(2.28) 


But  this  is  equivalent  ”0 
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which  Is  true  since  hp  ■  a  ■  V.  Similarly 

pq  pq  -  7 

HP  -  Hp  *  0  «•  2(hP  -hp  )  +  hq  -  hp  <0  «*  2v(o)  -  2hP  *  0 
qq  pp  qq  pp  pp  qq  pp 


since 


Thus 


Ir  ■  a  Se  V  and  v(o)  <  V. 
PP  PP 


HP  ■  min  max  Hp 

«  i  i  li 


(2.29) 


Finally 


Hq  -  Hq  5:  0 
pp  qp 


2(hq  -hq  )  +  hP  ~  hq  >0  «  2v(o)  -  2a  5t  0 

pp  qp  qq  pp  qp 


since  a  *  M,  and 

qp  “ 


Hq  -  Hq  *  0  «  2(hq  -hq  )  +  hP  -  hq  *0  «•  2 v(o)  - 

pp  qq  pp  qq  qq  pp 


2hq  <  0 

qq 


since  v(o)  <  V  £  a  .  Hence 

qq 


Hq  ■  min  max  Hq  , 
PP  j  i  iJ 


(2.30) 


and  (2.27)— (2. 30)  yield 


fl(o)  -  v(o) . 

Case  2:  V  -  V 

Although  we  could  use  the  same  procedure  as  in  Case  1  again  we  prefer 
a  simpler  argument.  If  V  -  V  -  V  then  the  matrix  A  has  a  saddle,  say 
a.  ■  V.  Since  every  n-stage  0-garae  G  (o;k),  being  a  game  with  perfect 

ICJlr  tl 

information,  has  a  pure  value  we  must  have 


0n(o;k)  ■  V  for  every  n  ■  1,2,... 


But  by  Theorem  1  11m  ft  (o;i) 

n~  n 

(l(o)  ■  V  for  all  0  <  o  %,  1 


■  ft(a)  regardless  of  161 
and  the  theorem  Is  proved. 


.  Hence 
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3.  Washburn’s  Approach. 

We  will  proceed  to  prove  formula  (1.3)  by  showing  that  Player  1 
can  get  at  least  fl*(o)  +  0(l/n),  that  Player  II  can  prevent  player  I 
from  getting  more  than  Q^I(o)  +  0(l/n),  and  that  (l*(o)  ■  0^I(o)  ■ 
fl(a).  The  two  functions  ft*(c)  and  QI^(o)  will  be  defined  within  the 
theorem.  For  future  reference,  let  the  clockwise  and  counterclockwise 
averages  be 

vc(o)  =  .5((a11+a22)(l-o)  +  (a12+a21>o), 

and 

vcc(o)  =  .5((a11+a22)o  +  (a12+a21) (l-o)) . 


Theorem:  Formula  (1.3)  is  true. 


Proof :  We  will  show  that  players  I  and  II  can  asymptotically  confine  them¬ 
selves  to  four  strategies  each,  those  strategies  being  the  four  functions 
mapping  {1,2}  onto  Itself.  For  example,  the  strategy  (2,1)  for  I 
means  that  I  will  always  choose  2  if  II  has  previously  chosen  1,  or 
1  if  II  has  previously  chosen  2.  We  will  use  the  symbol  s  x  to  mean 
that  the  strategy  s  for  I  assures  that  the  average  payoff  to  I  will  be 
at  least  x  +  0(l/n)  ,  where  n  is  the  number  of  moves,  regardless  of 
the  starting  state.  Similarly  for  II. 

We  proceed  by  showing  that 
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(1,2)  |  mln{vc(o),«11,«22) 

(2.1)  in{v^c(o) 

**12,a21} 

(1.1)  f  mln{«11,*12} 

(2.2)  ^  min{*21,m22> 

(1.2)  xl  ®**{vcc(o),»11,«22} 

(2.1)  ^  max{vc(o),a12,a21) 

(1.1)  ^  max{a11,a21) 

(2.2)  ^  max{a12,a22J  . 

for  I.  After  n  stages,  let 
(i,j)  initiated  by  I (II). 
of  the  starting  point, 

-  1 

-  1 

1 


For  example,  consider  the  strategy  (1,2) 
nij^nij^  ke  the  num^er  vi8it8  to  state 
Then  as  long  as  I  uses  (1,2),  regardless 

I  II  .II 
“il  *  n21  +  nH 

I  .  II  .  II 
n22  4  n12  +  n22 

II  II.  ^ 

n21  '  n12l  * 


-  0 

-  0 


n 

i  j 


n  . 


Since  the  average  payoff  to  I  is 


V(l,2) 


■  £  n  « 

i  j 


U 


(1-0) 


+  n 


II 

U 


> 
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a  lower  bound  on  ^  can  be  obtained  by  solving  a  linear  programming 
problem.  Furthermore,  if  we  define 


I  _  I  ,  .  II  II. 

Pij  =  nij'n  *nd  pij  * 


and  neglect  terms  that  are  0(l/n),  the  program  is 


min  l  Itp[j(l-o)  +  pJJ  oja^ 


(1,2) 


subject  to 


1  II  .  II 
P11  =  P21  +  P11 


I  II  .  II 
P22  *  P12  +  P22 


II  II 
P21  “  P12 


P21  “  P12  "  ° 


H  Py1  -  1  • 


There  are  six  equations  in  eight  variables.  There  is  an  optimal  solution 
in  wh-wh  at  least  two  of  the  variables,  in  addition  to  p^  and  p*2> 
are  0.  If  these  two  variables  are  p**  and  p**,  then  p^  ■  p*J  ■ 

p"  "  p22  ’  -5  '■md  v(l,2)  18  vc<0)'  11  PU‘°-  v(l,2)*a22> 

and  if  p*2  *  0,  v^  =  a^.  If  p**  -  0,  v^  ^  is  some  mixture  of 

and  a22.  In  all  cases,  (1,2)  min{vc(o)  ,a^,a22} .  Essentially, 
II  has  his  choice  of  either  going  around  in  a  clockwise  circle  or  else 
acepting  one  of  the  two  payoffs  that  I  keeps  trying  to  obtain.  The  rest 
of  the  -*■  statements  can  be  obtained  similarly. 

Taking  account  of  all  four  of  I's  (II's)  strategies,  we  see  that 
I  (II)  can  guarantee  that  the  payoff  will  be  at  least  (at  most) 
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" 


nX(o)  +  0(l/n)  and  QXX(o)  +  0(l/n),  where 

nX(ff)  -  max{min(vcc(o)  ,a12,«21)  ,min(vc(o)  ,a11,a22)  ,V} 

flIX(<r)  =  min{max(vcc(o),a11,a22),max{vc(o),a12,a21),V} 

The  fact  that  flX(o)  -  flXX(o)  -  fi(a)  can  be  shown  by  exhaustion  of  cases. 

Since  flX(o)  2  V  and  fiXX(c)  i.  V,  the  proof  is  trivial  if  V  ■  V.  If 

there  is  no  saddle  point,  suppose  a^  or  a22  is  the  largest  element 

M.  It  follows  that  the  term  Involving  vcc(o)  taay  be  deleted  from  both 

flX(o)  and  flXX(o),  that  the  second  largest  element  V  is  on  the  same 

diagonal,  and  also  that  v(o)  -  v  (o) .  The  result  is  that 

c 

fiX(o)  -  max{min{v(o)  ,V},V} 
flXX(o)  “  min{max{v(o)  ,V},V} 

Both  of  these  expressions  are  equivalent  to  1.3.  A  similar  proof  holds 
if  a12  or  a21  is  the  largest  element. 

QED 
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4.  Comparison  of  Techniques  and  Prognosis  for  Larger  Games . 

The  2x2  game  would  be  much  easier  to  solve  if  it  could  be 

discovered  that  the  function  fl(o)  satisfies  some  simple  functional 

equation  that  has  a  unique  solution.  This  does  not  seem  to  be  the  case. 

Shubert  deals  with  this  problem  by  inventing  a  situation  that  is  almost 

as  good.  He  Introduces  a  function  Q(o,X,k) (x^(X)  in  his  notation) 

which  does  satisfy  such  a  functional  equation,  and  then  shows  that 

lim  ft(o,X,k)  -  Q(o)  regardless  of  the  index  k.  He  next  shows  there 
X-*0 

are  only  a  finite  number  of  candidates  for  the  solution  of  the  equation, 
one  of  which  must  apply  for  all  X  smaller  than  some  positive  Xq. 

Given  the  matrix  A,  he  finds  the  solution,  and  consequently  fl(o). 

This  procedure  is  probably  generalizable  to  larger  games.  The  difficulty 
will  be  that  the  number  of  candidate  solutions  will  grow  very  fast  with 
the  size  of  the  matrix.  It  is  possible,  however,  that  a  systematic 
procedure  can  be  found  to  determine  the  correct  solution  without  an 
exhaustive  search. 

Washburn  uses  a  time-honored  technique  for  solving  two-person 
zero-sum  games;  he  guesses  the  optimal  strategies  and  shows  that  they  are 
in  equilibrium.  The  generalization  of  his  procedure  when  I  has  m 
strategies  and  II  has  n  strategies  would  be 

1)  Let  I  (II)  confine  himself  to  all  functions  mapping  (l,...,n(m)} 
onto  (1, . . . ,m(n) }.  There  are  mn(nm)  such  strategies. 

2)  For  each  such  strategy,  find  the  asmyptotic  bound  that  it  guarantees 
for  the  player  who  is  using  it  by  writing  a  Linear  Program  with  2MN 
variables. 

3)  Show  that  the  greatest  lower  bound  equals  the  least  upper  bound 
(a  theorem  is  required) . 
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Given  the  theorem  in  3) ,  one  could  £ind  the  fl-value  of  an  m  x  n  game 
for  one  particular  value  of  a  by  solving  mln{mn,nm}  Linear  Programs 
with  2mn  variables  each. 

It  Is  apparent  that  either  method  for  finding  the  fl-value  of  a 
large  game  would  require  a  vast  amount  of  computational  effort,  even  If 
all  the  required  theorems  can  be  proved.  The  central  problem  for  matrix- 
differential  games  would  therefore  seem  to  be  the  Invention  of  a  practical 
procedure  for  finding  solutions;  It  would  be  particularly  valuable  if 
the  procedure  were  able  to  find  solutions  for  all  0.  By  "practical," 
we  mean  a  procedure  where  computational  effort  increases  less  than 
exponentially  with  m  and  n. 
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